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ABSTRACT 

A 4-year longitudinal experiment conducted in 
Tennessee examined class-size effects on student achievement in 
kindergarten through grade 3. The Student Teacher Achievement Ratio 
(STAR) project included more than 7,000 students per year in 79 
schools in school systems. Class size categories were: small class 
(13-17 students), regular class (22-26 students), and regular class 
with full-time teacher aide. Students and teachers were randomly 
assigned to class categories. The study found that students in small 
classes made significantly (statistically and educationally) greater 
gains than other students. In addition, minority students in small 
classes benefitted more than minority students in other class 
categories. It was also determined that gains achieved in 
kindergarten were maintained through third grade. Analyses showed a 
continuing, powerful class-size effect in all locations. However, no 
consistent teacher-aide effect was evident in the study. The Lasting 
Benefits Study (LBS) had already analysed data from a sample of STAR 
pupils through grades A and 5 in an attempt to determine whether 
gains STAR students achieved in small classes carried through to 
those grades. The LBS found that students who were in STAR small 
classes at least in grade 3 were statistically and educationally 
ahead of other STAR students. Three appendices include STAR data 
collection instruments, primary and extended analyses designs, and 
analysis of variance for cognitive outcomes. (Author/JPT) 
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THE LASTING BENEFITS STUDY (LBS) IN GRADES 4 AND 5 (1990-1991 
A LEGACY FROM TENNESSEE'S FOUR-YLAR {K-3) CLASS-SIZE 
STUDY (1985-1989), PROJECT STAR* 



ABSTRACT 

Education leaders in Tennessee commissioned a four-year (8/85-8/89) longitudinal experiment of 
class-size effects on pupil achievement in early primary grades (K-3). The project included over 7.000 
pupils/year in 79 schools In 42 school systems. There were three conditions: Small class (13-17); 
Regular class (22-26) and Regular class with n full-time teacher aide. Pupils were randomly assigned to 
class-size conditions; teachers were randomly assigned to classes. Pupils in small classes (1 :15) made 
significantly (statistically and educationally) greater gains than other pupils, and minority pupils in small 
classes benefited more than minority pupils in other class conditions. Gains initiated in kindergarten were 
maintained through third grade. Analyses showed a continuing, powerful class-size effect in all locations. 
There was no consistent teacher-aide effect evident in the analysis. This large-scale randomized 
experiment provided some definitive answers about class-size effects in earl\ primary grades. 

The LBS (a field study) has already analyzed data from a sample of STAR pupils through grades 
4 (n=4320) and 5 (n=4649). In LBS studenls who were In STAR small classes at least in grade 3 are 
statistically (p < .001 ) and educationally (eftect sizes about .15) ahead of students who were in STAR 
regular and regular-with-aide classes. This finding, In ail locations and for all pupils shows that at least 
two full years after returning to regular (1:25) classes the former small-class pupils continue to pertorm 
better than their peers from regular and regular-with-aide classes. Although the differences maintain 
statistical significance (p < .01) in grade 5, the absolute differences are some less than in grade 4. 
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THE LASTING BENEFITS STUDY (LBS) IN GRADES 4 AND 5: 
A LEGACY FROM TENNESSEE'S FOUR-YEAR CLASS-SIZE 
STUDY (1985-1989), PROJECT STAR 



Introduction. Ba-kgroim d and Perspective: The P.TAR Leaacv 

One issue that has caused considerable continuing debate is research to find a reasonable 
alternative to one-on-one tutoring to overcome what Bloom (1984) called the Two-Sigma Problem." A 
good start in schooling seems imperative if children are to succeed later. What is a reasonable class size 
for teaching elementary pupils? Do small (e.g., 1:15) classes work well for all children and better for some 
children? If pupils benefit from small classes in early grades (K-3), do those benefits continue in later 
schooling? 

From 1985-89 researchers from four universities and State Education Agency (SEA) personnel in 
Tennessee (TN) cooperated on a large, longitudinal, legislatively mandated experiment of effects on pupil 
achievement and development in grades K-3 of small class size (1 ;15). Called Project STAR, the study 
included over 7,000 pupils in 79 schools in 42 districts. 

For STAR, pupils were randomly assigned to a Small (S, or average of 15 pupils with 1 teacher - 
1 :15), a Regular (R, or average 1 :26), or a Regular with full-time teacher Aide (RA) class and stayed in 
that condition (K-3). Replacement due to pupil mobility was random. There were more than 100 
classes/year of each condition. 

Extensive test and demographic data were collected on pupils for all years and grades of STAR; 
extensive demographic, context and process data were also obtained for teachers, principals, school 
buildings and districts, teaching practices, and other things. A listing of specific data instruments is in 
Appendix A. 

Although the pupil was the primary unit of data collection (researchers conducted teacher 
interviews, etc. to support the class size analysis), the class was the unit of analysis (it was a study of 
class-size effects). This analysis recognized that each pupil is UQl an independent measure - teacher 
and classmates all influence pupil learning. Outcome measures were pupil results for achievemenf nn 
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cntenon-referenced tests (CRT) and on norm-referenced tests (NRT) and for develooment on such things 
as discipline, attendance, self-concept, and the SCAMIN self-concept test. 

Design Issues: STAR 

Sites participated for Iqlh years, and agreed to have some visitations and extra testing, and to 
allow random assionment of pupils and teachers to conditions . Sites had space for the added classes 
and at least 57 pupils in K. This excluded small schools, but at least 57 pupils were needed for the in- 
school design (minimum of 1 :13. 1 :22. 1 :22) so that any school with an S class also included R and RA 
class conditions. The powerful design helped ameliorate building-level variables such as leadership, 
curriculum, facilities, expenditures. SES. Researchers monitored testing conditions for consistency. 

The state paid for additional teachers and aides. STAR personnel made Qn\x class-size changes. 
Districts followed their own policies, curricula, etc. No pupil in STAR received less (i.e., had a 
disadvantage) by being in STAR. Not even/ pupil took every test or had every data point, but all pupils in 
an analysis had all dat a needed for that analysis . 

The general Multivariate (MULTIVARIANCE: Finn & Bock. 1985) design included four locations 
and the class type (S, R. RA) for either achievement measures or non-cognitive measures. The primary 
analyses addressed the questions required in the legislation for each of the four years. Additional 
analyses are underway. (Details appear in technical reports: Nye et al.. 1991 . 1992.) Themodelfor 
analyses is in Appendix B. The primary STAR analysis consisted of multivariate tests of mean differences 
between and among the groups being analyzed. The Lasting Benefits Study (or LBS) follows this design 
to the degree possible. Table 1 provides a summary of the STAR and LBS relationships. 



Table 1 about here 



The analysis employed a general linear model approach for unequal-n design. The design has 
unequal n* and some empty cells and requires multiple error terms to test all the fixed effects. Test 
statistics were the univariate F-ratlo for each measure and Wilks' likelihood ratio for multivariate sets. 
There were two planned contrasts tested among three class types: 
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S class mean vs all R and RA class means (S vs. "Other). 
R class mean vs. RA class mean. 
Summary Rfi<;ijlt<; 

The consistency - even monotony - of the STAR findings is significant. Pupils in S did 
statistically significantly better (usually at p ^ .001) than pupils in R and/or RA at every grade level (K-3). 
The class size ettect v^-as foimn ennallv in all locati o ns fe.o. urban, nirpi^ ^nd favored the S condifinn in 
four grade levgls. Some findings appeared in single grades, or in two of the four years. Appendix C 
contains major achievement results (K-3), Measures of development (e.g., self concept, attendance, 
discipline) showed no difference between S and R/RA in K-3. 

Some simple analyses demonstrated powerful effects. A strong positive class-size benefit for 
minority pupils appears in the percent of pupils passing the CRT (BSF) in grade 1. (This result was 
confirmed in multivariate analyses that shewed differences in Grade 1 by race but disappeared in later 
grades - Appendix C) Over 17% more minority pupils pass the BSF if the pupils are in S rather than in R 
(orRA). Note the results in Table 2. 
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The statistical significance question seems to be resolved in class-size issues. STAR results also 
addressed the '^educational" significance question. Often "educational" significance dorives from "effect 
sizes." Effect size shows hQw much the difference is relative to a standard deviation. With the CRT the 
educational effect might be estimated as the percent passing. Effect sizes favoring S in STAR range from 
.08 (in K) to .40 (in grade 3) for minority pupils. Generally the positive STAR effect sizes for pupils in S 
are in the .20 to .27 range. This effect size was obtained with no change pxr^p t class size. (Consider 
what might be possible with class size as the base from which to build education improvement.) 
The LBS Continup^ thfi STAR 1 pganv 

Weikart (1989) pointed out the lasting benefits of early intervention. What about lasting benefits 
of small classes? What happens when STAR pupils who benefited from S in K-3 return in grades 4 and 



later to "regular classes? The LBS analyses use pupil test scores and behavioral indicators of school 
effons In 1989 Tennessee changed from the CAT to the Tennessee Comprehensive Assessment 
Program (TCAP). The LBS uses TCAP data. The fourth-grade analysis included 4230 pupils. (They 
were identified by class type in at least grade 3.) Of those 1412 were S. 1250 were R and 1568 were RA. 
Grade 5 analyses included 4946 pupils distributed approximately the same. 

Scaled-score means tor the three STAR class types (S, R, RA) were compared using MANOVA 
for unequal n's (MULTIVARIANCE: Finn & Bock, 1985). Following the STAR analysis design, three 
achievement subsets were compared separately for the LBS. Two subsets included scores from both the 
NRT and CRT components of TCAP. Set 1 included Total Reading and Total Language (NRT scores) 
and the number of domains mastered in Language Arts (CRT). Set 2 consisted of Total Math and Total 
Science (NRT scores), and the number of domains mastered in Mathematics (CRT). Set 3 included 
Study Skills (NRT) and Social Science (NRT) scores. (See Finn et a!.. 1989/1992.) 
LBS Summan/ Results 

To date (9/92) the LBS analysis has yielded clear and consistent results. Grade 4 and grade 5 
students previously in a small-size STAR class demonstrated that they had statistically significant (at least 
p. < .01) advantages over pupils previously in R and RA on every set of measurements. The greatest 
achievement advantages in grade 4 were for inner-city and suburban classes (Tabic 3). No RA v R 
contrast was significant either year, showing the consistency (beginning in STAR. grad:3 1) of no gain by 
using teacher aides to improve pupil academic performance in primary grades. By grade 5 ihe location 
analysis was discontinued as some pupils had entered middle-school configurations. 
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The LBS students who had attended small STAR classes had a statistically significant advantage 
over LBS students who had attended R and RA STAR classes. The cx)sitive eflects from e?^rly 
involvement in a small-size class still remained oeo^asive two fui; vears after studg-t^s rpttjrned to regular - 
SiZg.CtasseS- Small-class students outperformed R and RA class students on every achievement 



measure in all locations as shown ty scaled score differences (Table 4), The trend, also begun in STAR, 
of pupils ironn R outperlorming pupils from RA classes is still evident in both grades 4 and 5. (Note that 
no R/RA comparison reached statistical signif.cance. but the trend has remained constant on all measures 
from grade 1 through grade 5.) 
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The differences in performances of pupils previously in S. R and RA in at least grade 3 and now 
in regular (e.g.. i :25) classes in grades 4 and 5 can also be shown in effect sizes (Table 5). Note the 
consistent (but small) positive advantage of former S pupils over former R pupils and the consistent 
positive advantage of former R pupils over former RA pupils on all measures (except the CRT for grade 5) 
in both grades 4 and 5. This supports the statistical significance finding and helps formulate the 
difference into gai>,s relative to a standard deviation. In grades 4 and 5 the former S pupils exceed the 
former R and RA pupils from .1 1 to .22 of a standard deviation. The R/RA contrast shows effect sizes 
ranging from -.02 to -.09 (Finn et a!.. 1989/1992; Nye et al.. 1991 , 1992). 
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A Report o n Pupil Partip ipf^finn 

Finn (1 989) noted that increased student participation in school reflected a decreasing tendency 
for student alienation and dropout in later years. Opportunities for student panicipation (e.g.. clubs, 
service projects, government, music, athletics) can be established by teachers and administrators. 
Panicipation also includes a pupil's involvement in classroom activity (e.g.. responding, asking questions, 
doing projects). Small classes should increase that involvement. 

Finn et al. assessed a flmd^ subset of STAR pupils by asking their teachers to rate them on 
the 25-item Pupil Panicipation Questionnaire on a f,ve-point range from (l) "never" to (5) "always." 
Teachers rated pupils on three behavioral scales (Finn et al„ 1989/1992. p. 78): . , .Nonpanicpatory 
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Behavior (e.g.. "Annoys or interleres with peers' work"). Minimally Adequate Effort (e.g., "Pays attention in 
class"), and Initiative Taking (e.g.. ^Does more than just the assigned work"). 

Teachers (n=258) rated pupils (n=2.207) in their classes who had participated in STAR (S, R. RA) 
conditions for three years (grades 1 -3). Using the LBS M ANOVA design, scores on the three participation 
scales - Effort. Initiative and Nonpaaicipatory Behavior were simultaneous criterion variables (p. 79). 
Statistically significant differences were found on participation variables: [Location (p ^ .05); Class type (p 
^ .0001); Loc X Type (p < .05)] (p. 79). According to Finn et al. (1989/1992). "Pupils who had aHended 
small classes were rated as having superior modes of participation in grade four in comparison to their 
peers" (p. 81). The participation effect sizes (.11 to .14) were similar to effect sizes found in LBS 
achievement analyses (.11 to .22) No RA/R contrast was significant. This LBS analysis links the desired 
participation behavior to higher pupil academic achievement on measures used in LBS. Building upon the 
STAR database. LBS research is showing that early small-class involvement (e.g.. 1 :15) has continuing 
benefits (note also Weikart, 1989). This counters some criticism of the qq^ of reduced class size, since 
benefits continue years after the class-size reduction. 
Discussion 

The STAR data provide the basis for a longitudinal study of the "Lasting Benefits" to a pupil of 
being in a small-class setting during early primary grades (K-3). Some discussion and conclusions seem 
appropriate. The power of the design and therefore the strength of the results and the confidence that 
one has in the findings/conclusions diminish as one moves from the experiment of STAR to the LBS field 

However. LBS results build on the strong STAR database, taking advantage of the longitudinal 
and randomized study. The STAR and LBS results help in determining ways that achievement can be 
improved in schools. 

Class-size reduction, as a treatment or inten/ention. is really an one-time event. That is. the 
treatment is when the student first experiences the reduction from regular (e.g.. 1 :26) to small (1 :15); the 
ensuing years are a CQntinuatiQfl> but not a separate treatment. Nevertheless, benefits once gained seem 
to last for several years if the pupil stays in 1 :1 5 (STAR) or goes to regular (LBS) classes. 
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The LBS results (graces 4 and 5) snow the cx)ntinuing benefits of a pupil's participation in the 
small class in early primary. Post hoc analyses of important elements of schooling other than 
achievement (e.g.. panicipation) suggest a small-class influence here. too. Continuing analyses through 
LBS may be able to add to information provided by other longitudinal studies (e.g.. Weikart, 1989) of 
important ^q£^ benefits of early interventions. Zigler (1992) claims mapr social benefits (e.g.. reduced 
c-ime and delinquency) and strong academic benefits for Head Stan. (Unfortunately, policy makers 
continue to reduce Headstart funding.) 

Since LBS shows continuing benefits in pupil achievement after small-class involvement, can 
small-class involvement for only one or two years (rather than STARs four years) provide a sound base to 
help pupils get staaed well in school? STAR results were strongest in K and 1 , suggesting that these 
should, at a minimum, be the years of the small-class intervention. The early primary heterogeneous 
classes provided by the STAR random assignment and STAR'S seeming ability to help minority pupils 
close the achievement gap are promising areas for LBS analyses. Although consistent in all STAR 
conditions (S. R. RA). pupil assignment in STAR (random) was different from regular pupil assignment 
practices. Did pupil random assinnmpni positively influence STAR results in all or in some STAR 
conditions'? Additional analyses of the STAR database may help unravel this interesting question. 

Results of STAR (the experiment) provide clear evidence of ways to improve schooling in early 
primary grades. Given the added needs of children entering schools in the 1990's (e.g.. Hamburg. 1992: 
Hodgkinson, 1991) the use of small classes may become imperative for later school success. We have 
found a way to improve schooling: do we have the wiH? The STAR experiment results have held up in 
field research and in policy conditions (e.g., LBS. Challenge) and are continuing to show added, 
continuous benefits How much evidence do leaders need before Ihey apply these strong findings to help 
improve schooling'? (See Zigler. 1992. a. so.) 

Education excellence must stan somewhere. Given the problems experienced by many new 
entrants to schooling and the increasing family dislocations (Hamburg. 1992: Hodgkinson. 1991), drastic 
class-size reductions in early grades seem to offer the best hope yet advanced. Small classes provide a 
new ^floor" from which educators can work. This is a positive indicator of a restmcturirig process that will 
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help pupils. (Interestingly, small classes probably don't harm teachers, either. Might small classes be 
one way to help with new teacher induction into the proiession? Might smaller classes help new teachers 
really learn to teach? Might the first year or two of teaching in a small class help retain in teaching some 
people who leave due to frustrations engendered by being put into leaching situations that are not the 
most desirable?) 

Should these and similar studies be seen simply as studies in class-size reduction? Perhaps they 
are better cast as trying to find the right class sizes to help solve Bloom's (1984) "two-sigma" problem 
trying to match the size of the instructional unit to the job to be done. The results clearly suggest one way 
to move from assembly-line, industrial-age schooling to case-load, information-age learning activities. 

Ways to accommodate this social "paradigm shift" are at the heart of the movement to restructure 
(redesign, reinvent) education. Class-size reduction is a restructuring process. So. while others still are 
seeking some "magic bullets" to aid in education restructuring, this research offers strong 
recommendations that at a minimum restructuring should start with small groups of youngsters in at least 
K-1 or K-2. neterogeneously grouped (STAR assignment was random), with a teacher (and no 
instructional aide) as the instructional unit. 
The Future 

Using the STAR/LBS database, researchers at the Center of Excellence for Research in the 
Basic Skills plan several new and continuing studies. The LBS analyses will continue as long as is 
feasible. Tennessee has initiated Project Challenge, a 1:15 event (K-3) in 17 of the state's 138 school 
systems. Researchers will monitor these efforts. 

Using the STAR/LBS database, researchers plan a variety of new studies. Some of these 

include: 

Homogeneous v. heterogeneous pupil assignments. 
Retention-in-grade questions and issues. 

Teacher behavior differences in different (S, R, RA) classroom environments, 

Class-size and achievement gap reduction between white and non-white STAR pupils (and 

the lasting benefits of this). 
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Discipline/panicipation measures (grade 7), 

Class-size/school size interamons, 
? 

When completed, this series of studies should tell educators much more atx)ut the issues of class 
size and pupil achievement and development than have the numerous, but considerably smaller and less 
experimental, prior studies. The various studies and compilations of class-size studies are found in such 
citations as: Cahen et al., 1983; ERS, 1978 and 1980; Folger. 1989/92; Glass & sm.th. 1978; Glass et al.. 
1982; Mitchell et al.. 1989; Mueller et al.. 1980; Slavin, 1989 and 1990; Tomiinson. 1988 and 1990; etc. 
The full STAR report appears in Ward et al., (1990) and the STAR and various LBS technical reports 

(e.g.. Nye et al.. 1991 and 1992) are available from the Center of Excellence, Tennessee State University. 
Orlich (1991) called STAR . .the most significant educational research dene in the US during 

the past 25 years" (p. 632). The LBS is extending STAR results as a basis for future education policy. 

Educators need to apply results of definitive research ["This research leaves no doubt that small classes 

have an advantage over large classes in reading and mathematics in the early primary grades" (Finn & 

Achilles. 1990. p. 573)] in making school organization decisions. 
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Table 1. Relationships of STAR and LBS Showing Years. Grades. Measurements, etc: 1985-1992. 



Study 
STAR'- 

LBS* 



Years 
1985-89 

1990-92 



Grades 
K-3 

1 grade/yr 
4-6 



Measurement 

Each year & 
longitudinal 

Each year 

Cognitive 

Particip. 
Grade 4 



Instruments 

SAT/BSF & 
questionnaires 



TCAP 



Questionnaire 



•Pupils progressed through the grades and were tested each year. 



Table 2. Average Percent of Pupils Passing BSF Reading: Grade 1. STAR. 



Status 



Grade 



Small 



Minority 

Non-Mino.nty 

Difference 



65.4% 
69.5% 
4.1% 



Reg. 



48% 

62.3% 

14.3% 



Difference 
(S-R) or 
(S) Advantage 



17.4 
7.2 
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Table 3. LBS Results. Grade 4 (1989-90) and Grade 5 (1990-91) on TCAP. Summary of Class Effects 
Analysis Using Mean Scores of Sets. 



Grade: 


Verbal 

4 


5 


Set 2 
Math/Sci 
4 5 


Set 3 
Soc Sci/Study 
4 5 


Loc. (urban, etc.) 


P<.001 


N/A 


p<.001 


N/A 


ps-001 N/A 


Type (S.R.RA) 


ps.OOl 


P£.Ol 


P£.001 


P^.01 


P£.001 P:S.01 


Loc X Type 


NS 


N/A 


NS 


N/A 


NS N/A 



(Results found in all locations equally) 



Loc. differences on all sets favoring S in the location, but major difference is due mostly to lower- 
performing inner-city pupils. Type differences favors. R vs RA contrasts NS. Loc X Type class-type 
differences are the same in all locations. 



Tao.e 4. LBS: Grades 4 and 5. TCAP. Scaled Score Differences and the Differences in Mean Number 
of Domains Mastered between S and R Class Students and between RA and R Class Students Means 
are tabled in Appendix B of the Technical Repon (Nye et al.. 1991 . 1992). 



Measures 
NRT- 


1989-90 f4:h) 
S vs R RA V R 


1990-91 I5\h) 
SvsR RAvR 


Total Reading 


5.61 


-2.23 


10.53 


.10 


Total Language 


4.99 


-.73 


8.21 


-1.03 


Total Math 


4.87 


-2.29 


8.08 


-.34 


Science 


5.69 


-1.47 


8.99 


-2.66 


Social Sciences 


6.13 


-.20 


8.14 


-1.31 


Study Skills 


10.10 


-2.15 


10.62 


-.85 


CRT (Domains Mastered)' 










Language Arts: 


.25 


-.18 


.84 


.07 


Mathematics: 


.35 


-.09 


.68 


.16 



•S consistently better than R; RA consistently less good than R (-). 
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Table 5. LBS: Grades 4 and 5, 1989-90; 90-91. TCAP. Estimates of S and RA Effect Sizes. 



ERIC 



Measures 1989-90 (4th) 1990-91 fSth) 
NRr S V R RA V R S vR RA v R 

Total Reading .13 -.05 .22 .00 

Total Language .13 -.02 .18 -.02 

Total Math .12 -.06 .18 -.01 

Science .12 -.03 .17 -.05 

Social Science .11 -.04 .17 -.03 

Study Skills .14 -.03 .18 -.01 

CRT (Percent differences)* 

Language Arts .11 -.09 .34 .03 

Mathematics .16 -.04 .28 .07 

*S consistently better than R: RA consistently lower than R excepton CRT, grade 5. 
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Appendix A 

DATA COLLECTION INSTRUMENTS: STAR. 1985-1989 

1 Profiles Data collecied include: 

System - Enrollment, total expenditures per student, location, etc. 
School: Type. Size. type ol community served, special programs, etc. 
Pnncipal : Age. sex. race, education, experience, etc.. 

Teacher. Age. sex. race, education, certification, experience, career ladder level, 
attendance, etc. 

A'de; Age. sex, race, education, experience as an aide. 

Proiect Student: Age. sex. race. SES. special educaiion programs. 

Comparison Student - Age. sex, race, and SES. 

2- Stantprg Early School A chievement Test fSESAT in and other forms of SAT tn mc^'A^^urf^ 

pupil achievement in math and reading/language arts, based on national norms. 

3- Seif'Conceot and Motivation Inven tory fSCAMIN) to measure elements of academic self- 

concept and academic motivation. 

Basic Skills Masten/ (BSF). A curriculum-based criterion-referenced test to measure 
mastery of objectives in grades 1 , 2, and 3. 

5- Groucino Questionnaire to study how teachers regularly divide students into groups for 
instruction. 

6. Parentn~e3Cher interaction OuestiQnnpirft to determine the amount of time teachers spend 
interacting with parents during a school year. 

Teacher/Problem Checklist (Cruickshank) to measure teacher perceived problems related 
to class size and pupil/teacher ratio. 

8. Teacher Log provides a self-reporied use of school time (also Aide log). 

9 Aide Questionnaire to obtain basic information regarding aides' sjpervision. job 

description and training. 

10 Exit Intea'ews to obtain teacher perceptions penment to the project. 
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APPENDIX B 

Primary and Extended Analyses Designs: STAR (1985-1989): LBS 1990-1992. 



Sample Design: 



4 Locations (Urban, njral, etc.) (Fixed Effect) 

Schools nested in Locations (Random Effect) 

Class types (S.R.RA) crossed with (Fixed Effect) 

locations and school types 

2 Training categories' (Fixed) 



Source Table 



Source of Vari;^tion: Error Term : 

Location (L) Schools 

Training* (TR) Schools 

"^ype (T) School X type 

^-^T School X type 

1->^TR School 

TxTR School X type 

LxTxTR School x type 



Degrees of freedom (df) 
Ach. Meas. Noncog. Meas, 
Schools e.g. (1986) 75 69 

School X Type e.g. (1987) 149 137 

Classes within School-Types (etc.) 



Primary fvlodel: Measures 

Achievement (Ach): SESAT. SAT. BSF Matched 

Noncognitive (Noncog): SCAMIN. Attendance, t-tests 

Behavior, etc. 

Extended Model: Mep^sures : 

Sex (or Race, or SES) Ave. Diff Scores on Ach. Multivariate 

Sex (or Rdce, or SES) Ave. Diff. Scores on Noncog. Models 
Training* 

Two planned contrasts: S class mean vs means of all R and RA; S vs ^R + ra 2) 

RA class mean vs R class mean. 

Each effect tested holding constant earlier effects in order of elimination. TR and T each tested as last 
main effect: LxTR and LxT each tested as iast two-way interaction. 

Ar.alysis of BSF done with "log-odds index." 

Tor grades 2 and 3. a random subset of schools was chosen to study the effects, if any. of teacher 
training (TR) on pupil outcomes. The training used had no significant effect. 
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APPENDIX C 



Analysis of Variance for Cognitive Outcomes, STAR, Grades K-3. 
Sig. Levels p<.05 or Greater are Tabled. 









Reading 






Mathematics 




Eftect/3 




Multi- 


SAT^ 


BSF 


Ml ilti- 

IVI U III 




Dor 


Grade 




vanate 


^ Read 


Read 


variate 


b Math 


Math 


Location (L) 


K 




.02 






.05 






1 


.(J 1 


.06 




.05 








2 


.001 


.001 


.001 




.001 


.001 








.UU 1 


AA 1 
.UU1 


.001 


.001 


.001 




i 

J 


.001 


.001 


.001 


.001 


.001 


.001 




2 


.001 


.001 


.001 


.001 


.001 


.001 


TvoefT^ 


K 




.001 






.02 






1 


.001 


.001 


.001 


.001 


.001 


.05 




o 
iL 


.001 


.001 


.05 


.001 


.001 


.05 




3 


.001 


.001 


.001 


.001 


.001 


.001 


SES 


K 




.001 






.02 








.05 




.05 








Loc X Type 


K-3 


All N/S. 


The class-size effect is found equally in all 


locations -- Inner 








City. Suburban. Urban and Rural schools. 


(Tabled as innportant.) 




Race X Type 


1 


.05 


.05 


.01 








LxRxT 


1 






.05 






.01 


LxTRxT 


2 


.05 


.01 


.05 


.05 


.05 


.01 



NOTE: Only statistically significant (<.05)results are shown, a The nonorlhagonal design required 
tests in several orders (Finn and Bock, 1985). Results were obtained as follows- each main effect was 
tested elinnmating both other main effects; loc x race tested eliminating main effects and loc x type- loc x 
type tested eliminating main effects and loc x race: race x type tested eliminating main effects and'other 
two-way interaaions. and loc x race x type tested eliminating all else (Finn and Achilles 1390) t) 
Obtained from F-approximation from Wilks' likelihood ratio. Essentially, no statistically significant 
differences were obtained on the self-concept and/or motivation (SCAMIN) measures. 

This table, the table in Appendix B and tables in the text have appeared in other articles reoorls and 
papers that discuss STAR results. ' ^ 
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